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Abstract 



Constructions of woven graph codes based on constituent block and convolutional codes are studied. It is shown 
O . that within the random ensemble of such codes based on s-partite, s-uniform hypergraphs, where s depends only on 

the code rate, there exist codes satisfying the Varshamov-Gilbert (VG) and the Costello lower bound on the minimum 
distance and the free distance, respectively. A connection between regular bipartite graphs and tailbiting codes is 
shown. Some examples of woven graph codes are presented. Among them an example of a rate R wg = 1/3 woven 
graph code with df rcc = 32 based on Heawood's bipartite graph and containing n = 7 constituent rate R c — 2/3 
convolutional codes with overall constraint lengths v c = 5 is given. An encoding procedure for woven graph codes 



with complexity proportional to the number of constituent codes and their overall constraint length v c is presented. 

Index terms — Convolutional codes, girth, graphs, graph codes, hypergraphs, LDPC codes, tailbiting codes, woven 
00 codes. 

o : 

> ! I. Introduction 

^ . Woven graph codes can be considered as a generalization of low-density parity-check (LDPC) block codes [1]. 
^ \ Their structure as graph codes makes them suitable for iterative decoding. Moreover, the LDPC block codes are 
known as codes with low-complexity decoding and they can be considered as competitors to the turbo codes 
[2] which are sometimes called parallel concatenated codes. As mentioned in [3], the underlying graph defines a 
permutation of the information symbols which resembles the interleaving in turbo coding schemes. 

On the other hand, similarly to the LDPC codes, graph codes usually have minimum distances essentially smaller 
than those of the best known linear codes of the same parameters. At a first glance, the minimum distance of a graph 
code does not play an important role in iterative decoding since the error-correcting capability of this suboptimal 
procedure is often less than that guaranteed by the minimum distance. However, in general, the belief-propagation 
decoding algorithms work better if the girth of the underlying graph is large, that is, if the minimum distance of 
the graph code is large [4]. 

In the sequel we distinguish between graph, graph-based, and woven graph codes. We say that a graph code is a 
block code whose parity-check matrix coincides with the incidence matrix of the corresponding graph. Graph-based 
codes constitute a class of concatenated codes with constituent block codes concatenated with a graph code (see, 
for example, [3]). Each vertex in the underlying graph corresponds to a constituent block code. The main feature 
of these codes is that the block length of their constituent block codes coincides with the degree of the underlying 
graph. 
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We introduce woven graph codes which are, in fact, graph-based codes with constituent block codes whose block 
length is a multiple of the graph degree c, that is, their block length is Ic, where I is an integer. In particular, when 
I tends to infinity we obtain convolutional constituent codes. 

Distance properties of bipartite graph-based codes with constituent block codes were studied in [3]. It was 
shown that if the minimum distance of the constituent block codes is larger than or equal to 3, then there exist 
asymptotically good codes with fixed constituent codes among these graph-based codes. Also it was shown in [3] 
that for some range of rates, random graph-based codes with block constituent codes satisfy the VG bound when 
the block length of the constituent codes tends to infinity. One disadvantage of graph-based codes that becomes 
apparent in the asymptotic analysis is that good performances can only be achieved when the block length of the 
constituent block codes (which in this case coincides with the graph degree c) tends to infinity. In practice this 
leads to rather long graph-based codes with not only rather high decoding complexity of the iterative decoding 
procedures but also high encoding complexity. 

In this paper, we consider a class of the generalized graph-based codes which we call woven graph codes with 
constituent block and convolutional codes. They are based on s-partite, s-uniform hypergraphs. Notice that graph- 
based codes with constituent block codes based on hypergraphs were considered in [5], [6]. It is mentioned in [5] 
that Gallager's LDPC codes are graph codes over hypergraphs. 

We consider first woven graph codes with constituent (Ic, lb) block codes. A product-type lower bound on the 
minimum distance of such codes is derived. In order to analyze their asymptotic performances we modify the 
approach used in [3] to s-partite, s-uniform hypergraphs and constituent (Ic, lb) block code^- It is shown that when 
I grows to infinity in the random ensemble of woven graph codes with binary constituent block codes we can find 
s > 2 such that there exist codes satisfying the VG lower bound on the minimum distance for any rate. 

In order to generalize the asymptotic analysis to woven graph codes with constituent convolutional codes we 
assume that the binary constituent block code is chosen as a zero-tail (ZT) terminated convolutional code and 
consider a sequence of ZT convolutional codes of increasing block length I. It is shown that when the overall 
constraint length of the woven graph code tends to infinity in the random ensemble of such convolutional codes 
we can find s > 2 such that there exist codes satisfying the Costello lower bound on the free distance for any rate. 

We also describe the constituent convolutional codes as block codes over the field of binary Laurent series [8]. 
This description as well as the notion of block Hamming distance [9] of convolutional codes is used to derive a 
product-type lower bound on the free distance of woven graph codes with constituent convolutional codes and to 
construct examples of such woven codes with rate R wg = 1/3. For a given hypergraph the free distance of the 
woven graph code depends on the numbering of code symbols associating to the hypergraph vertices. By a search 
over all possible permutations of the constituent code we found an example of a rate R wg = 1/3 woven graph 
code with overall constraint length v = 64 and free distance df iee = 32. The rate R wg = 1/3 woven graph code is 
based on Heawood's bipartite graph [10], [11] and contains constituent convolutional codes with overall constraint 
length v c = 5 and free distance df ree = 6. 

We consider also the encoding problem for graph and woven graph codes. The traditional encoding technique for 
graph codes has complexity 0(N 2 ), where N is the blocklength. We show by examples that some regular block 
graph codes are quasi-cyclic and thereby can be interpreted as tailbiting (TB) codes (see, for example, [12], [13]). 
It is known that the encoding complexity of such codes is proportional to the overall constraint length of the parent 
convolutional code. 

By using a TB representation for the graph code we can construct an example of an encoder for a woven graph 
code that is also represented in the form of a TB code but with overall constraint length less than or equal to 2nv c , 
where n is the number of constituent convolutional codes with overall constraint length v c each. 

In Section II, we consider some properties of s-partite, s-uniform, c-regular hypergraphs. We define woven graph 
codes with constituent block codes as well as with constituent convolutional codes and obtain product-type lower 
bounds on their minimum and free distances. Then, in Section III, we derive a lower bound on the free distance of 
the random ensemble of woven graph codes. In Section IV, examples of woven graph codes are given. We conclude 
the paper by considering encoding techniques for graph codes and woven graph codes in Section V. 

'When we were preparing this paper we were informed that the possibility of achieving the VG bound by considering hypergraphs was 
known to A. Barg [7]. 
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II. Preliminaries 

A hypergraph is a generalization of a graph in which the edges are subsets of vertices and may connect (contain) 
any number of vertices. These edges are called hyperedges. A hypergraph is called s-uniform if every hyperedge 
has cardinality s or, in other words, connects s vertices. If s = 2 the hypergraph is simply a graph. The degree 
of a vertex in a hypergraph is the number of hyperedges that are connected to (contain) it. If all vertices have the 
same degree we say that this is the degree of the hypergraph. The hypergraph is c-regular if every vertex has the 
same degree c. 

Let the set V of vertices of an s-uniform hypergraph be partitioned into t disjoint subsets Vj, j = 1, 2, . . . , t. A 
hypergraph is said to be t-partite if no edge contains two vertices from the same set Vj, j = 1, 2, . . . , t. 

In the sequel we consider s-partite, s-uniform, c-regular hypergraphs. Such a hypergraph is a union of s disjoint 
subsets of vertices. Each vertex has no connections in its own set and is connected with s — 1 vertices in the other 
subsets. In Fig. Q] a 3-partite, 3-uniform, 4-regular hypergraph is shown. It contains three sets of vertices. They are 




Fig. 1. A 3-partite, 3-uniform, 4-regular hypergraph. 

shown by triangles, rectangles, and ovals, respectively. There are no edges connecting vertices inside any of these 
three sets. The vertices are connected by hyperedges each of which connects three vertices. 

A cycle of length L in the hypergraph is an alternating sequence of L + 1 vertices and L hyperedges where all 
vertices are distinct except the initial and the final vertex, which coincide, and all edges are distinct. The girth of a 
hypergraph is the length of its shortest cycle. In Fig. |2] we show a subgraph that contains the shortest cycle of the 
3-partite, 3-uniform, 4-regular hypergraph in Fig. [T] It consists of the vertices 5, 10, and 5 and has girth equal to 2. 
We introduce the notion of a compact (> d)-connected subgraph in the hypergraph. It is a connected subgraph in 
which each vertex is incident with at least d hyperedges. We call the length (number of hyperedges) of the shortest 
compact subgraph its (s,d)-girth. In Fig. [2] the hyperedges belonging to the shortest (> 2)-compact subgraph are 
marked by circles. It is easy to see that (3,2)-girth is 6. 

A 2-partite, 2-uniform hypergraph is a bipartite graph. For such a hypergraph the (2,2)-girth is equal to the girth 
and a compact subgraph is a cycle. Heawood's bipartite graph [10], [11] with 14 vertices and 21 edges is shown in 
Fig. |3] This graph contains a set of n = 7 black and a set of n = 7 white vertices. Each vertex has no connections 
within its own set and is connected with c = 3 vertices from the other set. The girth of the Heawood graph is 6. 



A. Graph-based codes and graph codes 

In order to illustrate the structure of a binary graph-based block code with constituent block codes we represent 
the Heawood bipartite graph using a so-called Tanner graph [15] as shown in Fig. |4] 

We introduce a set of nc = 21 (variable) vertices which correspond to the code symbols. Each of the 2n = 14 
(constraint) vertices on the right- and left-hand sides corresponds to one of 14 parity checks. The c = 3 edges 




leaving one constraint vertex correspond to a codeword of the constituent (c, b) block code of rate R c 
parity-check matrix of the corresponding graph-based code with binary constituent block codes is 



H gh = 



Hi 
H 2 



b/c. The 



(1) 



where the parity-check matrix H\ of size n x nc = 7 x 21 has the form 

/ H c 0000 \ 



H c 



V 







H c J 



where H c is a size (c — b) x c = (3 — b) x 3 parity-check matrix of the constituent block code, and Hi is a size 
nxnc = 7x21 parity-check matrix which is the permutation of the columns of H\ determined by the graph. 
Notice that in general by choosing b < c and assigning constituent block codes of different rates R c = b/c to 
the same graph we can obtain graph-based codes of different rates. In general, since in an s-partite, s-uniform, 
c-regular hypergraph the total number of parity checks is equal to sn(c — b), the code rate i? g b of the graph-based 
code is 

n(c — s{c — b)) 



Rgb > 



nc 



s(R c - 1) + 1 



(2) 



with equality if and only if all parity-checks are linearly independent. If s = 2, then we get i? g b > 2R C — 1. 



Fig. 4. A Tanner graph ( c = 3, n = 7) representation of Heawood's bipartite graph. 



The simplest example of a Heawood graph-based code can be obtained by choosing as constituent block codes 
a single-parity-check code of rate R c = 1/3. Then the parity-check matrix H c has the form 

H c = ( 1 1 1 ) 

and the parity-check matrix of the graph-based code is 
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In this case the graph-based code coincides with the graph code since (0) is the incidence matrix of the Heawood 
graph. In [4] it is proved that the minimum distance of the bipartite graph-based code with single-parity-check 
constituent codes is d m i n = g, where g is the girth of the corresponding graph. Notice that for the Tanner graph we 
have d m in = g/2. The parity-check matrix (0 is a 14 x 21 parity-check matrix. Taking into account that one check 
is linearly dependent on the other, we obtain a (21,8) binary block code. Its minimum distance is d m - m = g = 6. 
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Consider the hypergraph shown in Fig. [TJ Its incidence matrix has the form 
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and is a 12 x 16 parity-check matrix of a hypergraph-based code which coincides with the parity-check matrix of 
the hypergraph code. Each column represents a hyperedge and each row represents a vertex of this hypergraph. 
For example, the first four rows represent the vertices 1, 2, 3, and 4 (triangles), the next four rows he vertices 5, 6, 
7, and 8 (rectangles), and the last four rows the vertices 9, 10, 11, and 12 (ovals). The first column represents the 
hyperedge which connects the vertices 1, 5, and 9, the second column the hyperedge connecting vertices 1, 6, and 
10 etc. The rows of (@]) are linearly dependent. By removing two parity checks we obtain a (16,6) linear block 
code with the minimum distance d m - m = gs,2 = 6, where g^i is the (3,2)-girth of the hypergraph. The rate of this 
hypergraph code is R^ g = 3/8, which satisfies inequality ©, 

i? hg >3^-l)+l = I 
The Tanner version of this hypergraph is shown in Fig. [5] 




Fig. 5. A Tanner graph representation of the (16,6) hypergraph-based code. 
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For an s-partite, s-uniform, c-regular hypergraph-based code with constituent block codes we have the following 
theorem. 

Theorem 1: The minimum distance of a hypergraph-based code based on an s-partite, s-uniform, c-regular 
hypergraph with (s,d^ in )-girth g s>d . 
is 



and containing constituent block codes with minimum distance d^ nin > 2 



Proof. Any nonzero codeword in an s-partite, s-uniform, c-regular hypergraph-based code always corresponds to a 
connected (> d^ in ) -subgraph or a set of disjoint connected subgraphs. These subgraphs are called active [14], [4]. 
All hyperedges and vertices in an active subgraph are also called active. The number of hyperedges in the shortest 
connected subgraph is equal to g s ^<= ■ Any nonzero symbol in a codeword corresponds to an active hyperedge in 
the graph. By using the arguments given above, we conclude that for any codeword v, 

where W]i(v) is the Hamming weight of v. Minimizing over v completes the proof. 

B. Woven graph codes with constituent block codes 

Now assume that the constituent code assigned to the hypergraph vertices is a binary (Ic, lb) linear block code 
determined by a parity-check matrix 



H. 



H 12 
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2,c 



H'< 



(5) 



l (c-fc),l - I - 1 (c-6),2 ••• -"(c-b^c / 

where Hf- £ B\ x i is a size / x / matrix, B[ X i is the set of all possible binary matrices of size / x /. 

Let C2(H C ) denote such a binary (lc,lb) constituent block code determined by the matrix (f5]). We call the 
corresponding hypergraph-based code with C2(H C ) as constituent codes a woven graph code with constituent block 
codes. 

Consider an example of a woven graph code based on the bipartite graph with girth 5 = 4 shown in Fig. [6] The 
Tanner version of this the so-called "utility" bipartite graph is shown in Fig. |7J 




Fig. 6. Utility bipartite graph. 

The incidence matrix of this graph is 
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Fig. 7. A Tanner graph representation of the utility bipartite graph. 

We use a constituent (4x3,4x2) linear block code with d min = 3 determined by the parity-check matrix 

H c = {H c x HI HI) 

/lOOOjlllOjllOOX 
OlOO'0 111'OllO 

ooio|ioii|ooii 
\oooi;iioi;iooi/ 

By searching over all possible permutations of the matrices H\, H%, and H^ we found the following parity-check 
matrix of the woven graph code with the best minimum distance 
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(7) 



The matrix ([7]) describes a (36, 12) linear block code with d m i n = 10. 

Any codeword v c of the (Zc, lb) constituent block code can be represented as a sequence of c blocks of length 
I, that is, v c = (v±, v%, ■ ■ ■ , v%), where v\ = (v^ l ,vf 2 , • • • , v^), i = 1, 2, . . . , c. We define the minimum Hamming 
block distance between the codewords v c and v c of the constituent block code as 

dblock = mjn {w b lock(v c - v c )} 

where wl lock (v c ) = #(uf ^ 0), i = 1, 2, . . . , c. Next we will prove the following theorem. 

Theorem 2: The minimum distance of woven graph codes based on s-partite, s-uniform, c-regular hypergraphs 
with (s,d^ ocfc )-girth g s ^<> k and containing constituent block codes with minimum distance d^ in and minimum 
block distance d^ lock > 2 can be lower-bounded by 

dmm > max 1 9s ' db '°- k ; s | e^. n . 
Proof. Any nonzero codeword corresponds to an active connected subgraph or a set of disjoint connected 
subgraphs and the number of hyperedges in the shortest subgraph is g s ^ . Any nonzero symbol in a codeword 
activates a hyperedge in the graph, that is, not less than s constituent subcodes correspond to a codeword. Since at 
most c hyperedges are connected with any hypergraph vertex then the number of active constituent subcodes can 
be lower-bounded by 

9s,dj lock 
c 

Taking into account that any codeword of block weight greater than or equal to dl lock in the constituent block code 
has a weight at least equal to d^ in we obtain the following inequality 

w u (v) > max j ga '^'° cfc ,sj (i min 

for any codeword v and the proof is complete. 

From Theorem [2] for the woven graph code determined by (0 we obtain that cZ m i n > max ||, 3} 3 = 9. 
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C. Woven graph codes with constituent convolutional codes 

Woven graph codes with constituent convolutional codes can be considered as a straightforward generalization of 
the woven graph code with constituent block codes. Assume that the C2{H C ) code is chosen as a zero-tail terminated 
(ZT) convolutional code and consider a sequence of ZT convolutional codes with increasing Z. It is evident that 
when I tends to infinity the (7c, lb) constituent code C2{H C ) can be chosen as a rate R c = b/c binary convolutional 
code with constraint length v c . Then the corresponding woven graph code has rate R = s(R c — 1) + 1 and its 
constraint length is at most snv c . 

Another description of woven graph codes with constituent convolutional codes follows from the representation 
of the constituent convolutional code in polynomial form. Let G C (D) be a minimal encoding matrix [8] of a rate 
R c = b/c, memory m c convolutional code, given in polynomial form, that is, 

/ 9ii(D) ... g c lc (D)\ 



G C (D) 



(8) 



WiP) ••• 9 c bc (D)J 



where gfj(D) = gfp + g^f D + g^ D 2 -\ h gf^D m , i = 1, 2, . . . , b, j = 1, 2, . . . , c, are binary polynomials 

such that m c = maxjjjdeg gfj(D)}. The overall constraint length is v c = J2i maxj{deg g^(D)}. The binary 
information sequence u c (D) = {u\{D), Ur,{D), . . . ,u%(D)) is encoded as 

v c (D) = u c (D)G c {D) 

where v c (D) = (v\{D), v%(D), . . . , v^(D)) is a binary code sequence. Let H C (D) denote a parity-check matrix 
for the same code, 

(h c u (D) ... 
H C (D)= : 

\Ki(D) ... 

where r = c — b is the redundancy of the constituent code. 

We denote by F2((-D)) the field of binary Laurent series and regard a rate R c = b/c constituent convolutional 
code as a rate R c = b/c block code C c over the field of binary Laurent series encoded by G C (D). Then its codewords 
v c (D) are elements of F2((-D)) c , which is the c-dimensional vector space over the field of binary Laurent series 
[8]. 

The minimum Hamming block distance between the codewords Vj(D) and v^{D) is defined [9] as 




(9) 



dbiock = , min {w h i oc k(vj(D) - v k (D))} 

Vj(D)^v k (D) 

where w block (t>(Z))) = #(vi(D) ^ 0) is the Hamming (block) weight of v(D) = (vi(D), V2(D), . . . ,v c (D)). 

Representing a convolutional code as a block code over the field of binary Laurent series we can obtain a woven 
graph code with constituent convolutional codes as a generalization of a graph-based code with binary constituent 
block codes. For example, a parity-check matrix H wg (D) of the rate R wg = 4/3 — 1 = 1/3 Heawood's graph-based 
code with R c = 2/3 constituent convolutional codes has the form 
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where h\ and t\ are short-hand for hf(D) and t°(D), respectively, and H C (D) = {h\{D) h^D) /13(D)) is a 
parity-check matrix of the rate R c = 2/3 constituent convolutional code and (i£(-D), t%(D), t^(D)) is one of six 
possible permutations of h\{D), h^D), h^(D). 

Exploiting the above definitions we can interpret this bipartite woven graph-based code with constituent con- 
volutional codes as follows. The left column of vertices in Fig. @] represents n parity checks each of which 
determines one of n constituent fixed and identical convolutional codes and their nc branches represent the elements 
v!jj(D) £ F2 ((£))), i even, < i < 2n — 2, 1 < j < c. Similarly, the right column of vertices represents same 
convolutional codes and their nc branches represent the elements vf^(D) £ ~F2((D)), i odd, 1 < i < 2n — 1, 
1 < J < c, where the set {uf? (D)} is a random permutation of the set {^(Z))} determined by the graph. 

We can also regard the n left constituent convolutional codes as a warp with nc threads. Each of the n right 
constituent convolutional codes are tacked on c of the threads in the warp such that each thread of the warp is 
tacked on exactly once. Thus, our construction is a special case of a woven code [17] and we call this graph-based 
code a woven graph code. 

Theorem 3: The free distance of a woven graph code based on an s-partite, s-uniform, c-regular hypergraph with 
the (s,d^ ocA .)-girth 9s,di lock and containing constituent convolutional codes with free distance df ree and minimum 
block distance d\,\ock > 2 can be lower-bounded by 

dfree > max 1 9s4b ^ k } s I d c ircc . 

Proof. Since woven graph codes with constituent convolutional codes can be considered as a generalization of 
woven graph codes with constituent block codes, the theorem follows from Theorem [2] when I tends to infinity. 

For a woven graph code based on a bipartite graph with girth g and containing constituent convolutional codes 
with minimum block distance dl lock = 2 and free distance c?£ ree by a straightforward generalization of the approach 
of [4] we obtain the following tighter bound on the free distance 

diree > max {-, 2 j df ree . (1 1) 

III. Asymptotic bounds on the minimum distance of woven graph codes 

We will show that the ensemble of random woven graph codes based on random s-partite, s-uniform, c-regular 
hypergraphs with a fixed degree c and with a fixed number of vertices n in each subgraph contains asymptotically 
good codes. In order to prove this we will modify the approach in [3]. 



A. Woven graph codes with constituent block codes 



First we consider the ensemble of random woven graph codes with rate R c = b/c constituent block codes 
determined by the edges of a random s-partite, s-uniform, c-regular hypergraph corresponding to the time-varying 
random parity-check matrix 

(HA /7ri(ffl)\ 
#2 7T 2 (# 2 ) 



H„ 



\HsJ 



Vs(H s )J 



where Hi = iii(Hi), i = l,2,...,s, is a block matrix of size nc(l — R c 
n(c — b)l x ncl) and 7Tj denotes a random permutation of the columns of Hi, 

(h^ ... \ 



H 







H 





c(2) 







V 



(12) 



x nc (or a binary matrix of size 



(13) 



Hf n) J 



where H^\ t = 1, ... ,n, denotes the random parity-check matrix (f5]) which determines the (lc,lb) constituent 
block code and n is the number of constituent codes in each subgraph. 
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Remark: In [3] a more restricted ensemble of random codes is studied in which all matrices are identical random 
matrices. In the proof of Theorem 1 we need that the syndrome components are independent random variables in 
the product probability space of random matrices and random permutations. The following simple example shows 
that this is not always the case if all matrices are identical. 

Consider n = 1 constituent block codes of block length c = 2 with 6 = 1 information symbols. This example 
is rather artificial since the rate of the constituent block code R c = 1/2 and therefore the rate of the graph-based 
code with s = 2 is R wg = s(R c — 1) + 1 = 2R C — 1 = 0. In this case the parity-check matrix of the code has the 
form 

H ^ = ( 7t{H 2 ) ) 

where ir is a random permutation of c elements. First assume that all matrices are identical, that is, Hi = H 2 . 
There are only 8 equiprobable elements in the product space, namely, 

{Fwg} = { ( ) ' ( ) ' ( 1 ) ' ( 1 ) ' 

(!S)-(S!)-(i !)•(!!)}■ 

For any vector x of weight 1 we have the following set of random equiprobable syndromes: 

{xH^} = {( ),( ),( ),( 1 ), 
(1 !),(! 0),(1 !),(! 1)}. 

Therefore, 

P{xHl g = 0\w n {x) = 1) = | > |. 
If Hi and H 2 are both random and independent this probability is equal to 1/4. 

Although this remark contradicts the proof of Theorem 3 in [3], there exists another (combinatorial) way to prove 
the same statement for identical Hi [16]. 



Next we prove the following theorem. 

Theorem 4: (Varshamov-Gilbert lower bound) For any e > 0, some Iq > 0, some integer s > and for all 
I > Iq in the random ensemble of length ncl woven graph codes with (Ic, lb) binary block constituent codes of rate 
R c = b/c there exist codes of rate R wg = s(R c — 1) + 1 such that their relative minimum distance 5 wg = d m - m /ncl 
satisfies the inequalities 

r (5(R wg )-e, if R wg > l + s\og 2 (l-5v G (R wg )) 

wg " 1 S VG (R wg ) - e, if R wg < 1 + s log 2 (l - <5v G («wg)) K } 

where 5(R wg ) is a root of the equation 

(1 - s)h(S) - 5slog 2 (V(^w g -i)/s _ i^j = o 

and 5yc(R wg ) is the solution of h(5) + R wg — 1 = 0, and h(-) denotes the binary entropy function. 

Proof. Let w be the Hamming weight of the codeword v of the random binary woven graph code C 2 (H wg ). We 
are going to find a parameter d such that the probability P(vH^ g = 0\w) tends to for all w < d. We can rewrite 
P(vH^ g = OH as 

P(vH? g = OH = 5>(vi^ g = 0\w,j)P(J\w) (15) 

3 

where j = (ji,j 2 ,---,j s ) and ji denotes the number of nonzero constituent codewords in the ith subgraph 
corresponding to the codeword of weight w. 

In the ensemble of random parity-check matrices H c( ^ , t = 1, 2, . . . , n, of size Zc(l — R c ) x Ic the probability 
that a nonzero vector v c is a codeword of the corresponding constituent random binary code C 2 (H C ) is equal to 
2~(c-6)/ s j nce syndromes of the constituent codes are equiprobable sequences of length (c — b)l. Taking into 
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account that in the ith subgraph we have % nonzero constituent codewords the probability P(vH^ g = 0\w,j) can 
be upper-bounded by 

P(vH? g = 0\w,j) < f[ 2 -^-n°). (16) 

i=l 

In order to estimate the probability P(j\w) we prove the following lemma. 

Lemma 1: For the ensemble of binary woven graph codes with constituent block codes described in Theorem [141 
the probability P(j\w) that a codeword of weight w contains j = (Ji,j2, ■ ■ ■ ,js) nonzero constituent codewords 
in the s subgraphs can be upper-bounded by 

nj\ W )<u [j ' )[w/ ;i^ - ™ 

1=1 \ w ) 

Proof. Taking into account that in the ith subgraph the number of nonzero component codewords is equal to ji and 
that the subgraphs are random and independent we can rewrite the probability P(j\w) as 

s 

8=1 

The probability P (ji\w) can be upper-bounded as 

where Hi(v,w,ji) = {Hi \ vHf = 0,w,ji}. The cardinality of Hi(v,w,ji) can be upper-bounded as 



^)(4)%w) 

where the sum is upper-bounded by the maximal term times the number of terms (J J . ■ 
Notice that in the above derivations we ignored the fact that w/ji can be noninteger since we consider the 

asymptotic behaviour of (fT5l) . 
It follows from Lemma 1 that 

^-<r-rn^G)(^)*C;:D 
^„ +1) .(t) 1 -*n. T ----G")(4)'G:I) 

Consider the asymptotic behaviour of (fT5l ) when m tends to infinity. Introduce the notations 7 = j/n and 
5 = w/{ncl) and the function 

_ lim iog 2 P(^ = QH 

l^oo nlc 
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After simple derivations we obtain 

F{5) < F{8) = \ max (1 - s)h(5) - (1 - i? wg ) 7 + sjh (- ) } (20) 
l7£(0,i] V7/J 

where R wg = s(R c — 1) + 1 is the rate of binary woven graph code. Maximizing (|20l over < 7 < 1 gives 

5 



7op , = min^l, r _ 2(Rwg _ 1)/s 



Inserting 7opi < 1 and ^ opt = 1 into (1201 we obtain 

£m_J h(S) + R wg -l, if < < 1 - 2 C«-«-D/- 

(1 - *)/»(<*) - 5s log 2 (2-(«w g -i)/ S _ 1) , if 5 > 1 - 2( fi -- 1 )/ s { ' 

which coincides with (9) and (10) in [3] for s = 2, that is, if the graph is bipartite. 

For any R wg and S from F(6) < 0, it follows that there exist codes of rate R wg with relative minimum distance 
&wg = S. Let 5(R wg ) denote the solution of the equation 

F(5) = (22) 

for < 5 < 1 - 2^-^/ s and let 5 VG (R wg ) be the solution of h(5) + i? wg - 1 = 0. Solving © for 7op t < 1 
and 7o P i = 1 we obtain that there exist woven graph codes of rate R wg with the relative minimum distance S wg 
satisfying the inequalities: 

f <5(-Rw g ) - e, if ,R W g > 1 + slog 2 (l - 5 VG (R wg )) 
wg " 1 <Vg(#w S ) - e, if # W g < 1 + fllog 2 (l - 5 YG (R wg )). ^ 
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Fig. 8. The relative minimum distance as a function of the code rate for the ensemble of binary woven graph codes with block constituent 
codes. 

In Fig. [8] the lower bound (fl4l ) on the relative minimum distance for the ensemble of binary woven graph codes 
with block constituent codes as a function of the code rate is shown. It is easy to see that when s grows the 
ensemble of binary woven graph codes contains codes meeting the VG bound for almost all rates < R wg < 1. 
Fig. [9] demonstrates the gap i?vc — ^ W g between the VG bound and the code rate as a function of the relative 
minimum distance <5 wg for different values of s. It follows from Fig. [9] that for s > 3 the difference in code rate 
compared to the VG bound is negligible. 
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B. Asymptotic bound on the free distance of woven graph codes with constituent convolutional codes 

Consider a ZT convolutional woven graph code with constituent ZT convolutional codes of rate R c = b/c. The 
length of a ZT woven graph codeword in nc-tuples is equal to I + m wg where I is the number of nc-tuples influenced 
by information symbols and m wg is the memory of the woven graph code of rate R wg = s(R c — 1) + 1. Denote 
by dJJL the free distance of the corresponding woven graph code. 

Now we can prove the following 

Theorem 5: (Costello lower bound) For any e > 0, some mo > 0, some integer s > 2, and for all m wg > ttiq 
in the random ensemble of rate R wg = s(R c — 1) + 1 woven graph codes over s-partite, s-uniform, c-regular 
hypergraphs with constituent convolutional codes of rate R c = b/c there exists a code with memory m wg such that 
its relative free distance <5^g e = dj r ^ e /ncm wg satisfies the Costello lower bound [8], 

A w § > e (74) 

Proof. Analogously to the derivations in the proof of Theorem |4] let j = (j l5 j 2 , . . . ,j s ) where jj denotes the 
number of nonzero constituent codewords in the ith subgraph corresponding to the codeword of weight w, ji £ 
{1, ...,n}. In order to evaluate the number of nonzero constituent codewords among the n constituent codewords, 
notice that the set of such codewords is a union of sets of nonzero constituent codewords belonging to each of 
the s subgraphs. The cardinality of the union is at least j max = maxjjjj}. Therefore the all-zero "tail" required to 
force the encoder into the zero state has length at least j ma , x cm wg . The total number of redundant symbols consists 
of two parts: the number Ylt=i ji c K^ ~ °f parity-check symbols for the nonzero constituent codewords in the 
s subgraphs and at least j max cm wg redundant symbols required for zero-tail terminating of the woven graph code. 
Thus, formula (fT6l) can be rewritten as 

P(vH^ = 0\w,j) < ( nc(Z + mwg) ) ^f[2-^- R ^ 2-*-"*-. (25) 



The statement of the Lemma Q] is changed in a following way 

w/ji I Vji-1 



^M^n w ^iLi,, ' <26) 



1=1 
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Instead of ( fT9l ) we now have 

1-s 

/ n / - i i -J- 1 1 1 i \ 

P(vtfl = 0H < (n+l) s 



it) 

x . T {( 2 ----(;!)C (i :r ) ) J G': 1 1 ))' 2 -"i (27) 



By introducing the notations 

x w I j 

ncm wg m wg n 

we obtain from d2"51)-(l27l) that 

log 2 P(u# T = 0|io) 



= lim 



m w „— >oo 



ncm wg 



< max i (1 - + ( y^J - j(l + fi - fiR wg ) + 7 (1 + j^^- 
Maximizing d28l) over < 7 < 1, we obtain 



(28) 



7 ^ = min { 1 ' (l + ^(l-2-») } (29) 

where 

1 + /i(l - i? W g) 

If s is large enough, then 7 opi = 1. It follows from d28l) that 

F(<5) < (1 + (1^) - 1 - M + A g , (30) 
Maximization of F(<5) over /x gives 

F opt (5) < -5 log 2 (2 1 - R - - 1) - i? wg (31) 

where 

_ -, 

Vopt - 1 _ 2Rwg _! 1. 

We can find a bound on by solving F opt {5) = 0. Thus, we can conclude that for any e > we can find a 
woven graph code such that (1241) holds ■ 



IV. Example 

We start with considering a graph code determined by the parity-check matrix (f3]). As mentioned before, the 
matrix (f3]) can be considered as a 14 x 21 parity-check matrix. Since the parity checks defined by the graph are 
linearly dependent (the sum of the rows of ® is equal to zero) it turned out that by ignoring one parity check we 
obtain a parity-check matrix of a (21, 8) linear block code. For simplicity we consider the rate R g = 1/3 code that 
is obtained by ignoring the eighth information symbol which yields a (21,7) subcode of this code. 

It is easy to see that renumbering the graph vertices by adding to each vertex number some fixed number modulo 
the total number of vertices preserves both the incidence and adjacency matrices of the graph. For example, in Fig. 
[3l by adding 2 modulo 14 we will get exactly the same graph. When we have a similar property for linear codes we 
call such codes quasi-cyclic codes and these block codes can be described as tailbiting (TB) convolutional codes. 
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Renumbering the vertices corresponds to permuting the rows of (f3]). By row permutations, © can be reduced to 
the form 
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(32) 



It follows from d32l that the graph shown in Fig. [3] corresponds to a (21, 7) TB code with a parent convolutional 
code determined by the parity-check matrix 



1 1 1 

1 D D 3 



(33) 



It means that this code is "tail-bitten" at the 7th level of the trellis diagram. A corresponding polynomial generator 
matrix of the parent convolutional code has the form 



Gconv(£>) = ( D + D 2 1 + D + D 2 1). 



(34) 



The minimum distance of the (21,7) TB code is equal to the graph girth, that is, d m - m = g = 6. 

Notice that many regular bipartite graphs look very similar to the Heawood graph in the sense that by manipulating 
the incidence (parity-check) matrices and truncating lengths we can obtain infinite families of graphs. Some 
properties of these graphs can be easily predicted from the properties of the corresponding parent convolutional 
codes. 

Consider the parity-check matrix (fTOb of the woven graph code based on the Heawood bipartite graph with 
constituent convolutional codes of rate R c = 2/3. This woven graph code has the rate R wg = 2/3-2 — 1 = 1/3. 

Let the rate R c = 2/3 constituent convolutional code of memory m = 3 and overall constraint length v c = 5 
with c?£ ree = 6 be given by the generator matrix 

1 + D 2 
D + D 2 + D 3 



G C (D) 

A corresponding parity-check matrix H C (D) 

H C {D) = 



D 2 
1 



1 + D + D 2 
1 + D 2 



(35) 



(36) 



1 + D + D 4 
1 + D + D 3 + D A + D 5 
K l + D 2 + D 3 + D A + D l 

Notice that the constituent code C c considered as a block code over W((D)) represents a (3,2) block code with 



the minimum distance d 



block 



By using the product-type lower bound (TTTb we obtain 



3 x 6 = 18. 



On the other hand, it was verified by computer search that any codeword of the woven graph code determined by 
(fTOb consists of at least three nonzero codewords of the component code C c described by (|36l ). Moreover, it was 
found by computer search that each of these nonzero codewords of C c has the minimum block weight c?biock = 2- 
Note that the codewords of the block code over F((_D)) with block weight cf£ lock = 2 corresponds to the codewords 
of the convolutional code belonging to its subcodes of rate R c = 1/2. These three subcodes have generator matrices 



GOD) 
G C 2 (D) 



gl(D) 9 c{D)) 
g c 3 (D) g c 2 (D)) 
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G C 3 (D) = [g^D) g c 3 (D)) 

where g\(D) = 1 + D + D 3 + D A + L> 5 , g§ (£>) = 1 + 7J + D A , and gf (D) = 1 + D 2 + 7J 3 + D A + L> 5 . 

The minimum free distance over all these subcodes of rate R c = 1/2 is equal to 8. Taking into account that all 
other codewords of the woven graph code contain at least four nonzero codewords of C c of block weight d^iock = ^ 
we obtain an improved lower bound on the free distance of the woven graph code as df ree > min{3 x 8, 4 x 6} = 24. 

In order to obtain an upper bound on the free distance of the woven graph code we consider the parity-check 
matrix (fTOt in more detail. It also describes a quasi-cyclic code and can by row permutations be reduced to a 
parity-check matrix of a two-dimensional code, a TB (block) code in one dimension and a convolutional code in 
the other, 
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(37) 



H(D,Z) 



(38) 



A parity-check matrix of the parent convolutional code for the TB code (l37l ) given in symbolic form is 

h\{D) h c 2 {D) h c 3 (D) 

q(D) q(D)z q(D)z* 

where Z and D are formal variables. The matrix d38l can be considered as a parity-check matrix of a two- 
dimensional convolutional code. The variable Z corresponds to the parent convolutional code of the Heawood 
graph code (|33l , the variable D is used for the constituent convolutional code d3"6l ). 

A generator matrix of the two-dimensional convolutional code with the parity-check matrix d38l has the form 

G(D,Z) = (g c e (D)Z + g c c (D)Z 3 g c a {D) + g c d (D)Z 3 g c b (D) + g){D)Z ) (39) 

where g c a (D) = h%(D)ti(D), g c b (D) = h c 2 {D)t\{D), g c c (D) = h c 2 (D)t c 3 (D) , gg(Z>) = h<i(D)t c 3 (D), g c e (D) = 
h c 3 (D)t c 2 (D), and gj(D) = h\{D)t c 2 {D). 

The generator matrix d39l ) tail-bitten over variable Z at length 21 yields the generator matrix G{D) of the code 
d37]>, 
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(40) 



where g| is short-hand for gf(D). 

Notice that any of the six permutations of the columns h^(D), i = 1,2,3, generates a woven graph code. The 
permutation t\(D) = h\{D), t 2 (D) = h 3 (D), and t 3 (D) = h 2 {D) describes the woven graph code with the largest 
free distance. The overall constraint length of this generator matrix is equal to 70 but the matrix is not in minimal 
form. A minimal-basic generator matrix [8] has the overall constraint length equal to 64 and differs from (|40l by 
one row which can replace any of the rows of G(D) and has the form 



G (D) G (D) G Q (D) G (D) G (D) G (D) G (D)) 



where 



G (D) = { g ;(D) g c q (D)gC(D) 
where g c (D) = D + D 2 and g c JD) = l + D + D A . 
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The matrix d40l is a generator matrix of a convolutional code of rate R wg = 7/21. By applying the BEAST 
algorithm [19] to the minimal-basic generator matrices corresponding to the different permutations of the columns 
hf(D), i = 1,2, 3, we obtained the free distance and a few spectrum coefficients of the corresponding woven graph 
codes. The parameters of the best obtained woven graph codes are presented in Table 1. 

TABLE I 

Spectra and overall constraint lengths of rate i? wg = 1/3 woven graph codes 



Permutation 


V 




Spectrum 


ht(D),h c 3 (D),h c 2 (D) 


64 


32 


7,0,0,0,0,0,7,0,7,0. . . 


h c 2 (D),hi(D),h c 3 (D) 


65 


32 


7,0,0,0,7,0,0,0,21,0. . . 


h c 2 (D),h c 3 (D),hi(D) 


66 


30 


7,0,0,0,0,0,0,0,7,0. . . 



V. Encoding 

Generally speaking, encoding of graph-based block codes has complexity 0(N 2 ), where N is the blocklength. 
This technique implies that we find a generator matrix corresponding to the given parity-check matrix and then 
multiply the information sequence by the obtained generator matrix. However, we showed by examples that some 
regular graph codes as well as woven graph codes are quasi-cyclic codes and thereby they can be interpreted as TB 
codes. For this class of codes the complexity of the encoding is proportional to the constraint length of the parent 
convolutional code. 

In this section we are going to illustrate by an example an encoder of a woven graph code with constituent 
convolutional codes having encoding complexity proportional to the overall constraint length of the corresponding 
woven graph code u wg < nsv c . 

Consider again the woven graph code in our example. It is based on the Heawood graph and uses constituent 
convolutional codes of rate R c = 2/3 and overall constraint length v c = 5. Taking into account the representation 
(l40l of the woven graph code as a rate R wg = 7/21 two-dimensional code, a TB (block) code in one dimension 
and a convolutional code in the other, we can draw its encoder as shown in Fig. [TOj 
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Fig. 10. An encoder of the two-dimensional woven graph code. 



The input symbols u±,U2, ■ ■ ■ ,117 enter the encoder once per each cycle of duration seven time instants. At each 
time moment the contents of each register is rewritten into the next (modulo seven) register and the three output 
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symbols v\,V2,vs are generated. In other words, each of the registers corresponding to the constituent code can 
be considered as an enlarged delay element of the encoder of the "TB -dimension" code determined by the graph. 
The sequence ui , U2, ■ ■ ■ , u? determines a transition between the states of this encoder. After a cycle of seven time 
instants we return to the starting state of the enlarged encoder and a TB -codeword (or a word from one of its 
cosets) of length 2 1 has been generated. Then the following seven input symbols us , ug , . . . , uu enter and after 
seven time instants another word of length 21 has been generated, etc. 

VI. Conclusion 

The asymptotic behavior of the woven graph codes with block as well as with convolutional constituent codes 
has been studied. It was shown that in the random ensemble of such codes based on s-partite, s-uniform, c-regular 
hypergraphs we can find a value s > 2 such that for any code rate there exist codes meeting the VG and the 
Costello lower bound on the minimum distance and free distance, respectively. Product-type lower bounds on the 
minimum distance of graph-based and woven graph codes have been derived. Example of a rate R wg = 1/3 woven 
graph code with free distance above the product bound is presented. It is shown, by an example, that woven graph 
codes can be encoded with a complexity proportional to the constraint length of the constituent convolutional code. 
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